Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

نویسندگان

  • Wei Luo
  • Dinh Phung
  • Truyen Tran
  • Sunil Gupta
  • Santu Rana
  • Chandan Karmakar
  • Alistair Shilton
  • John Yearwood
  • Nevenka Dimitrova
  • Tu Bao Ho
  • Svetha Venkatesh
  • Michael Berk
چکیده

BACKGROUND As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. OBJECTIVE To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. METHODS A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. RESULTS The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. CONCLUSIONS A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Financial Reporting Fraud Detection: An Analysis of Data Mining Algorithms

In the last decade, high profile financial frauds committed by large companies in both developed and developing countries were discovered and reported. This study compares the performance of five popular statistical and machine learning models in detecting financial statement fraud. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements betw...

متن کامل

Determining the progression stages of liver fibrosis in patients with chronic hepatitis B

Introduction: Chronic hepatitis B (CHB) leads to liver fibrosis, its failure, and death in the long term. The stage of fibrosis in CHB patients can also be detected based on the biochemical markers. The aim of this study was to predict the state of liver fibrosis in CHB patients and determine the possibility of patients shifting from a given state to another one. Materials and Methods: This stu...

متن کامل

Prostate cancer radiomics: A study on IMRT response prediction based on MR image features and machine learning approaches

Introduction: To develop different radiomic models based on radiomic features and machine learning methods to predict early intensity modulated radiation therapy (IMRT) response.   Materials and Methods: Thirty prostate patients were included. All patients underwent pre ad post-IMRT T2 weighted and apparent diffusing coefficient (ADC) magnetic resonance imagi...

متن کامل

Analyzing the performance of different machine learning methods in determining the transportation mode using trajectory data

With the widespread advent of the smart phones equipping with Global Positioning System (GPS), a huge volume of users’ trajectory data was generated. To facilitate urban management and present appropriate services to users, studying these data was raised as a widespread research filed and has been developing since then. In this research, the transportation mode of users’ trajectories was identi...

متن کامل

Predictive Model Evaluation for PHM

In the past decades, machine learning techniques or algorithms, particularly, classifiers have been widely applied to various real-world applications such as PHM. In developing high-performance classifiers, or machine learning-based models, i.e. predictive model for PHM, the predictive model evaluation remains a challenge. Generic methods such as accuracy may not fully meet the needs of models ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2016